arXiv:1503.08686vl [math.ST] 30 Mar 2015 


An eigenproblem approach to 

optimal equal-precision sample allocation in subpopulations 

Jacek Wesolowski* Robert Wicczorkowskk 
March 31, 2015 


Abstract 

Allocation of samples in stratified and/or multistage sampling is one of the central issues of 
sampling theory. In a survey of a population often the constraints for precision of estimators of sub¬ 
populations parameters have to be taken care of during the allocation of the sample. Such issues 
are often solved with mathematical programming procedures. In many situations it is desirable to 
allocate the sample, in a way which forces the precision of estimates at the subpopulations level to be 
both: optimal and identical, while the constraints of the total (expected) size of the sample (or sam¬ 
ples, in two-stage sampling) are imposed. Here our main concern is related to two-stage sampling 
schemes. We show that such problem in a wide class of sampling plans has an elegant mathematical 
and computational solution. This is done due to a suitable definition of the optimization problem, 
which enables to solve it through a linear algebra setting involving eigenvalues and eigenvectors of 
matrices defined in terms of some population quantities. As a final result we obtain a very simple and 
relatively universal method for calculating the subpopulation optimal and equal-precision allocation 
which is based on one of the most standard algorithms of linear algebra (available e.g. in R soft¬ 
ware). Theoretical solutions are illustrated through a numerical example based on the Labour Force 
Survey. Finally, we would like to stress that the method we describe, allows to accommodate quite 
automatically for different levels of precision priority for subpopulations. 


1 Introduction 

Consider a population U partitioned into subpopulations U\,..., Uj, that is U = U j=i Ui and UiHUj = 
0 for i 7 ^ j, i.j = 1,... , J. Assume that we are interested in estimation of means of a variable y in 
all subpopulations. In each [/* a sample of n, elements, i = 1,..., J, is chosen according to simple 
random sampling without replacement (SRSWOR). Assume additionally that the size of the total sample 
n = n\ + ... + nj is fixed. A natural requirement is to allocate the sample among subpopulations in such 
a way that the precision (here and throughout the paper understood as coefficients of variation, CV’s) of 
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the estimators in each of the subpopulations arc the same. Throughout this paper by equal-precision we 
always mean equal-precision in subpopulations. That is we want to have 

Tj = (ht - 177 ) 7j = const =: T for all j = 1,..., J, 

where 7 y is the CV of V in Uj and Nj = #Uj,j = 1 Expressing rij in terms of Tj, j = 1,..., J, 

the constraint on the size of the total sample n = n\ + ... + nj gives the equation 


J 


n = Y.] 

3= 1 


JV^T l_ 
7 J+TNj 


with unknown T. It is easy to see that the above equation has a unique solution, which can be easily 
computed numerically (however no analytical explicit formula is available). Obviously, such a solution, 

Nj 7 ? 

T*, gives the optimal allocation rij = 737777 -' 3 = 1,for more details see, e.g. Lednicki and 
Wesolowski (1994). 

On the other hand if one imposes requirements on CV’s of estimators in subpopulations, that is when 
Tj, j = 1, ..., J, are given (not necessarily identical) there is no freedom in the sense that they determine 
uniquely the total sample size. If instead one assumes only the restriction that CV’s of domain mean es¬ 
timators arc bounded from above by (possibly) different constraints Tj, j = 1,..., J, the minimization 
of the total sample size is a valid question. It has been solved recently (with additional constraint on the 
CV of the estimator of the population mean) for stratified SRSWOR by Choudhry, Rao and Hidiroglou 
(2012) through a nonlinear programming (Newton-Raphson) procedure. These authors followed earlier 
application of such procedures to optimal allocation of the sample among strata for the population means 
estimation in multivariate setting as proposed in Huddlestone, Claypool and Hocking (1970) and Bethel 
(1989) (see also Ch. 12.7 in Sarndal, Swensson and Wretman (1992)). For an alternative numerical 
method (Nelder-Mead simplex method) used to sample allocation (and strata construction) under multi¬ 
variate setting, where subpopulations were also taken under account - see Lednicki and Wieczorkowski 
(2003). 

In an allocation problem for stratified two-stage sampling, when only the optimality of the estimator 
for the population as a whole is considered, traditionally a single constraint based on the expected total 
cost is imposed - see e.g. Ch. 2.8 in Sarndal, Swensson and Wretman (1992) or Ch. 10.9 and Ch. 
10.10 in Cochran (1977). Such issues were also considered more recently - see e.g. Clark and Steel 
(2000), Khan et al (2006) and Clark (2009) and references therein. From the practical point of view, 
the total cost of the two-stage survey may be difficult to model, therefore alternatively, constraints in 
terms of (expected) total sizes of SSU’s (secondary sampling units) and PSU’s (primary sampling units) 
may be imposed. Under such constraints, we arc interested in the allocation of PSU’s and SSU’s which 
guarantees optimal and equally precise estimators of means in all the domains. We allow stratification on 
both stages and propose quite general approach to the problem which is valid not only for simple random 
sampling without replacement in strata. Suitable definition of the minimization issue allows to reduce it 
to an eigenproblem for a rank-two perturbation of a diagonal matrix. Such an approach, in the context 
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of allocation of samples, was for the first time proposed in Niemiro and Wesolowski (2001) (denoted 
NW in the sequel), where stratification was allowed only either at the first or at the second stage with 
SRSWOR on both stages only. Moreover, some technical conditions were required in that paper, which 
allowed to use the famous Perron-Frobenius theorem in the proof of the main result. Such approach was 
applied in agricultural surveys in Kozak (2004) and in a forestry survey in Kozak and Zielinski (2005). 
Also it has been slightly developed theoretically by allowing CV’s in domains to be of the form KjT, 
j = 1 with known coefficients (levels of priority) Kj, j = 1and unknown optimal T, in 

Kozak, Zielinski and Singh (2008). 

The eigenvalue-eigenvector solution we present here is quite universal. In particular, it covers also 
classical Neyman optimal allocation as a boundary case of J = 1, that is the case of one subpopula¬ 
tion (see, e.g. Sarndal, Swensson, Wretman (1992), Ch. 3.7.3), or related solution in the two- stage 
sampling (see e.g. Sarndal, Swensson, Wretman (1992), Ch. 12.8.1). It includes also equal-precision 
allocation derived in Lednicki and Wesolowski (1994). Though these issues are rather standard and well- 
understood, as far as we know, they have never been embedded into and eigenproblem setting. Never¬ 
theless, our main concern here is two-stage sampling schemes. The point of departure for the present 
paper is that of NW, where optimal equal-precision allocation was considered in two special cases of 
two-stage sampling: (1) stratified SRSWOR at the first stage and SRSWOR at the second, (2) SRSWOR 
at the first stage and stratified SRSWOR at the second. In that paper eigenproblem approach together 
with the Perron-Frobenius theory of positive matrices were used. Here we develop a similar approach 
(though we go beyond the Perron-Frobenius theory) to a wider class of two-stage sampling schemes and 
with less restrictive requirements for the population characteristics. It is given in Section 3, where an 
eigenvalue-eigenvector solution of a general minimization problem leads to optimal and equal precision 
estimators in subpopulations for some stratified sampling plans. Section 2 is a kind of a warm-up: the 
proposed method is introduced in rather standard settings of single-stage sampling. A numerical exam¬ 
ple comparing the eigenvalue-eigenvector allocation with the standard one in the Polish Labour Force 
Survey is discussed in Section 4. Conclusions, involving incorporation of different levels of priority in 
the proposed method, are given in Section 5. 

2 Equal-precision optimal allocation in single-stage sampling 

In this section we consider single-stage stratified simple random sampling. The main puipose of this 
section is to give a friendly introduction to the approach via eigenvalues and eigenvectors, since in this 
case the proofs are less complicated than in the case of two-stage sampling. It also confirms relative 
universality of such purely linear-algebraic solution to the allocation problem, when the constraint of 
equal-precision is imposed. Nevertheless, in a single-stage setting one can use alternatively a direct 
numerical method as described in Remarks 12.21 or a combination of the direct numerical approach and 
the Neyman optimal allocation method as explained in l2.3l below. We would like to emphasize that such 
approaches are not possible in the two-stage setting considered in Section 3. 

We start with a general minimization problem, which, we show, can be treated through linear algebra 
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methods. 


2.1 Minimization problem - generalities 

Consider strictly positive numbers: Cj, Aj j / l , h = 1,..., Hj, j = 1..... d and x. Denote 


Hi 


a = (aj, j = l,...,d) = ^ [^2 V A j,h, j = l,-,d\ , c = (cj, j = 1 ,..., d). (1) 


,h =1 


Let 


( 2 ) 


D = aa T — diag(c). 

Theorem 2.1. Assume that D as defined in ©, has the unique simple positive eigenvalue A and let 
v = (vi,..., v,j) be a respective eigenvector. Then the problem of minimization of 


Hi 


T = 3 = 1,■■■,*, 


h=l 


where Xj ^ > 0, h = 1,... . Hj, j = 1 ,... ,d, under the constraint 

d h 3 

x bh = x , 

j =1 h= 1 


where x is a given positive number, has the solution 

x j,h. = x 


\i v A j } h _ 

Eti v k Egil y/AT a ‘ 


Moreover, T = A, the unique positive eigenvalue of matrix D. 

Proof. For x = h = 1,..., Hj, j = 1,... ,d) consider the Lagrange function 


Hi 


d ^ 


x(T, £ ) = r+y> E£Ec,-t 


j=i \h =t 


j =1 h= 1 


Differentiate with respect to Xj } h to get equations for stationary points 


dxj,h 


= P-Pj^ = 0 . 


j,h 


Therefore p/pj >0 and 


Xj,h = y-jt V A j,h, h = l,...,H j: j = l,...,d. 


(3) 


(4) 
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Plugging it to © we obtain 


d Hj 

X = Y1 v i X! VAhhi (5) 

j =1 h= 1 

where Vj = \J /y, 7 ///., j = 1,... ,d. Now the constraint © gives 

Hj 

X V A j,h - c j v j = Tv j > j = 1,..., d. 

/i=i 

By © it can be written as 

/ d H k \ Hj 

* I X X X - CjVj = Tvj, j = l,...,d. 

\k=l g=l / h= 1 

Alternatively, it can be written as 

Dv = Tv, 

where the matrix D is defined in ©. That is 0 < T = A is the unique positive eigenvalue and v is the 
eigenvector related to T. 

To show that the eigenvector v attached to the eigenvalue A has all coordinates of the same sign we 
use the celebrated Perron-Frobenius theorem: If A is a matrix with all strictly positive entries then there 
exists a simple positive eigenvalue p such that p > \u\ for any other eigenvalue v of A. The respective 
eigenvector (attached to p) has all entries strictly positive (up to scalar multiplication) - see e.g. Kato 
(1981), Th. 7.3 in Ch. 1. 

Fix a number a > maxi <j<j Cj > 0. The matrix D + al, where I is the identity matrix, has all 
entries strictly positive. For any eigenvalue 5j of D and respective eigenvector Wj 

(D + al)w_j = (■Sj + a)wj, j = 1,..., d. 

That is Sj + a, and Wj, j = 1..... d arc respective eigenvalues and eigenvectors of the matrix D + al. 
By the Perron-Frobenius theorem, there exists j 0 such that 6 J0 + a > \5j + a\ for any j and respective 
eigenvector Wj Q has all entries of the same sign. Consequently, Sj 0 + a > Sj + a, and thus Sj 0 > Sj 
for any j. Therefore, by assumption that A is the unique simple positive eigenvalue of D it follows that 
T = A = Sj 0 and the respective eigenvector v = Wj 0 has all entries of the same sign. □ 

Proposition 2.2. Let a, c € (0, oo)^ be such that 


Ef > L 


i=1 


Then the matrix D defined in © has a unique simple positive eigenvalue A. 


( 6 ) 
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Proof. For any d x d Flermitian matrix M denote by A i(M), i = 1 ,d, non-decreasingly ordered 
eigenvalues of M. Recall the Weyl inequalities (see e.g. Th. 4.3.1 in Florn and Johnson (1985)): Let A 
and B be Hennitian d x d matrices. Then 

4 4- B ) < A&(A) + A d{B) V k = 1,..., d. (7) 


Note first that D, as rank one perturbation of diagonal matrix, is non-singular and thus all eigenvalues 
of D are non-zero. Since aa T is non-negative definite of rank 1, we have A j (aa r ) = 0 for j = 1..... c/— 
1. Thus © with A = aa T and B = —diag(c) for k = d — 1 implies Xd-i(D) < 0 since all eigenvalues 
of B are negative and all eigenvalues of D are non-zero. 

Therefore, 

sgn det D = (-1 ) d_1 sgn(A d (.D)). (8) 

On the other hand expanding determinant of D 


det D = det 


oj — C\ O1O2 
0102 «2 — c 2 


aid'd a2dd 


a 1 ad 
a2dd 

a d - °d J 


and using the fact that oaf is of rank one we obtain 

d d d 


k =1 


k =1 


i= 1 


det d= ( ~i) d 1 yi n ° k+ ( _i ) d n ° k= n Cfc ( 1 _ yi s- 

i=l k =1 
k^i 

Comparing the above formula with (l29l) we have 


sgn(A d (D)) 


—sgn 


1 - V ^ 

ci 


i= 1 


Due to (1281) we get A := A d(D) > 0. 


□ 


2.2 Application to stratified simple random sampling 

The population U = {1,... , N} consists of subpopulations U\,.... Uj, that is U = U/=i Uj and 
Ui 0 Uj = % whenever i f j, i.j = 1 ,J. Consider a non-negative variable y in this population, 
that is let = y(k), k € U. We want to estimate the total value of y in each of subpopulations, that is 

we are interested in tj = Ylkeu- J = 1 ,,J. We use stratified simple random sampling without 

^ . 

replacement (SSRSWOR) in each subpopulation. That is Uj = (J/, L\ Ufh . Uj,h Cl Uj t9 = 0 for any 
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h g, g, h = 1,..., Hj, where Hj is the number of strata in Uj, j = 1,.. 
estimator has the form 



M j 

E 

h=l 



E S'*’ 

k£Sj : h, 


J. Thus the standard 


where Nj t h = #(Uj t h)> nj,h = fi( s j,h) and Sj : h denotes the sample chosen from Uj^, h = 1,..., Hj, 
j = 1,J. 

Recall that its variance is 


h =1 


c2 


where is the population variance in U jjh , that is S 2 j h = N , 1 h _ l T,keu j}h (Vk ~ Vu^h) 2 and y Uj h = 

Yjk£u jth yk- 

The problem we study is to allocate the sample of size n among subpopulations and strata in such 
a way that precision (expressed through CV) of the estimation is the same and the best possible in all 
subpopulations. That is we would like to find the two-way array (rij t h)h=i,...,H such that 

j I./' 


J H j 

EE n i’ h = n (9) 

3=1 h= 1 

and 

Hj 

E NjJh&j* = T, j = (10) 

h= 1 

with minimal possible T, which, actually, is the square of the CV. Then the double array (rij.h) is called 
the optimal equal-precision allocation in strata. 

Define 


Hj 

E 

h =1 


N 2 ,S 2 , 

,7 ,h j,h 

n j,h 


Aj,h 


HjA 


Hj 

Nj hS 2 h 

c i := x i’ h := x := n - 

h= t J 


For such Ajfi and cj define a and c as in ([B and note that d = J in the present setting. Let D be as 
defined in © for such a and c. Then directly from Theorem 12.11 we obtain an allocation which ensures 
both optimal and the same CV’s (precisions) of estimators of means in subpopulations: 


Theorem 2.3. Assume that the matrix D, as defined above, has a unique positive eigenvalue A. 
Then the optimal equal-precision allocation under the constraint © is 


Tij,h = n 


E l 


Vj Nj f fi Sj ,h/tj 

= 1 (E f=l N k , g S k , g ) /tk’ 
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where v = (v i..... vj) T is the eigenvector of D associated to X (with all coordinates of the same sign). 
Moreover, the common optimal value of the square ofCV’s T = A. 

Remark 2.1. Assume that the overall sample size n in ([9]) satisfies 


n < > --77-— 

St Ekii N i,kS] b 


( 11 ) 


Let D be the J x J matrix defined through ([2]) with a and c as above. Then by Proposition 12.21 it follows 
that assumptions of Theorem [23] are satisfied. Therefore the allocation using the eigenvector as given in 
the thesis of Theorem l2.3l is correct. 


Remark 2.2. Consider SRSWOR in each of subpopulations and the question of allocation (rii,..., nj ) 
under the constraint J2j =i n j = n > where rij denotes the size of the sample in Uj,j = 1,..., J. 
Moreover, 


N]S) 

tjrij 


-hN jS] = T , j = 


n 


( 12 ) 


where Nj = #{Uj) and S] = ^ Jf k£Uj (.Vk ~ Vu with y U;j = ^ Y,kcU :i Vk,j = !,■■■, J- 

This situation is embedded in the one we considered in Theorem l2.3l bv taking Hj = 1, j = 1,..., J. 
Then the vectors a and c od Corollary 12. 2 1 are of the form 



Note that the assumption ([ITT) is automatically satisfied since its right hand side equals N (> n). 
With v denoting the eigenvector from Theorem [23] we obtain 


rij = n 


v^NjSi/tj 

Ei=i ViNiSi/U ’ 




Alternatively, we can follow the approach described in Introduction: T can be obtained as the unique 
solution of equation 


n =J2 

3 = 1 


. 

T tj+NjS'j ’ 


then 


rii = m.ir j = 


(13) 


(14) 


j T+NjSj /t'j ’ 

Remark 2.3. An alternative approach to the general situation in which sub-populations are stratified 
is via the optimal Neyman allocation in each of sub-populations. That is we assume that for any i = 

1J 

n 3,h= n j ^wf -• 


Eg2l Nj,gSj,g 


h = 1 ,Lj 


(15) 









were n\ + ... + nj = n. Therefore, ( fi~0l ) leads to 



( 16 ) 


Therefore (similarly as in Introduction) we can solve (fT6l) for nj, j = 1,... ,J. Then using the constraint 
for the overall size of the sample we arrive at the equation for unknown T 



(17) 


Note that under condition (fill) a unique solution for T exists. It is obtained numerically. Then rij is 
obtained from (fT6l) and finally n ? j ); can be computed from (IT5l) . One can also derive the equation (fTTT) by 
minimizing the sample size in each subpopulation nj = rij t i + ... + n 3 .h- , j = 1,..., J, subject to a 
common precision T (see Sarndal, Swensson, Wretman (1992), Ch. 3.7.3). 

3 Equal-precision optimal allocation in subpopulations in two-stage sam¬ 
pling 

In this section we consider optimal equal-precision allocations under two-stage sampling. In the case 
of stratification on both stages and stratified simple random sampling without replacement (SRSWOR) 
we improve the result from NW in two directions. First, we relax some technical assumptions which 
originally were designed in order to use directly the Perron-Frobenius theorem on eigenvalues of matrices 
with positive entries. Second, we allow more flexible sampling designs, as stratified SRSWOR on both 
stages or Hartley and Rao (1962) systematic 7rps scheme at the first stage and SRSWOR at the second. In 
NW stratification was allowed either at the first or at the second stage, but not at both. Additionally, we 
consider a particular case of fixed sizes of samples of SSUs within PSUs - such an additional restriction 
is sometimes imposed in real surveys, see e.g. Lyson et al. (2013), p. 28-30. 

Similarly, as in the previous section we start with a more general minimization problem. 

3.1 General minimization problem 

In this section we consider and solve a minimization problem which unifies a wide class of optimal 
allocation problems with the same precision in subpopulations. The approach is similar to the previous 
section, however we have two vectors to allocate: one responsible for allocation of PSUs and one for 
allocation of SSUs. The direct numerical method as described in Remark [T2l does not work in such two- 
stage setting. The reason is that there is no way to express the elements of allocation vectors in terms of 
unknown common precision T. Consequently, analogues of (fl3l) and (fl4l) or ( fi~6l ) and ( 1771 ) are no longer 
available. 
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Consider real numbers: Cj > 0, Ajfi 0, Bjhi ^ 0, otjhi > 0, i €E Vj, h , h — 1,..., Hj, 
j = 1,..., d, x, z > 0. Denote 


Hi 


a = ( aj, j = !, • • •, d) = -4 ^ VA/Vo j = 1,..., d , 


(18) 


. /i=i 


Hi 


h. (Pji j 1, . . . , j ^ ^ ^ ^ sj Bj,h,ii j 1)- • > d 

\h =1 


Let 


D = aa T + bb T — diag(c). 


C= (Cj, j = l,...,d). 

(19) 

( 20 ) 


Theorem 3.1. Assume that D has the unique positive eigenvalue A > 0 and let v = (vi,, v c i) be a 
respective eigenvector. Then the problem of minimization of 


Hi 


°< T = Ei -v+E 


Zj.h.i 


Cj, j — 1,... ,d, 


h= 1 


iev 


j,h 


where Xj^h > 0, Zj^,i > 0, i € h = 1,... , Hj, j = 1,... ,d, under the constraints 

d Hj 

x i’ h = x 

j =1 h =1 

and 

d Hj 

y! yi x j,h a j,h,x z j,k,i= 

i=i ti=i »eVj,h 

where x and z are given positive numbers, has the solution 


( 21 ) 


( 22 ) 


(23) 


= x 


V is/ A i,h 

THT, 


Etl Eg=l v^’ 


and 


Z 3,h,i ~ x- h 


3 \ 


3,h Efc=l v k Eg = l Ei S V fc \J a k,g,l Hk,g,l 


(24) 


(25) 


Moreover, T = A, the unique positive eigenvalue of matrix D. 
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Proof. The proof adapts the argument used in the proof of Theorem 12.11 to the more complex situation 
of Theorem l3.ll 

Consider the Lagrange function 


Hi 


3 = 1 


. h=l 


E 

ieVj,h 


z j,h,i 



J Hi j Hi 

+/r £ S X Ph + V Y Y X i’ h Y a 3,h,i^,h,i- 

3 =1 /i=l 3 = 1 h=l i€Vj,h, 

Differentiate with respect to Xj^ and Zj^,i to get equations for stationary points: 


dF _ _ A i 
d x i,h xT 


3,h 


A 3,h+ 


B 


j,h,i 


i ev 7 . 


z j,h,i 


"T P -f" V ^ ' &j,h,i z j,h.,i 0 


and 


From (1271) we have 


B j,h.i + j, r ., n ., . — n 
rt - 1 ... . . rl'dfj hlXj h i o. 


_ / Aj Bj th j 


X j,h z j,h,i ~ y va jth<i 

Inserting the above expression into (l23l) we obtain 




^ “ 2 V a 3,h,i 

j =i ft=t 


On the other hand, inserting it into (1261) . upon cancelations, yields 


Returning now to (1221) we end up with 


Hi 


VJi = l E vT E \/t: 

i=i h=i 


/l* 


Returning to (I2TT) we get 


Hi 


Hi 


x :j "Y/ YYj + JZ Y1 Y1 V a 3,h,i Bj,h,i c j — 1 ■ 


h= 1 


(26) 


(27) 
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Multiply by Vj := ^f\j and plug in the formulas for yffi and \Jv to arrive at 


yi v A hj 

h =1 

+ 5 I Vk EE EE V a k,g,l Bk,g,l I EE EE \Ak/',(M Bj,h,i — c j v j = Tvj. 

\k =1 g= 1 J h= 1 iGVj^h 

Note that the above equation is equivalent to 

Dv = Tv, 

that is 0 < T = A is the unique positive eigenvalue and v is the eigenvector related to T. 

To show that the eigenvector v attached to the eigenvalue A has all coordinates of the same sign the 
argument is exactly the same as in the last part of the proof of Theorem l2.ll □ 


'/ a ks 

k=l g=l 


H k 


Hi 


Remark 3.1. In practical applications it is often important that instead of a constant T there are priority 
weights Kj > 0, assigned to each constraint (I2TT) . j = 1,... ,d. That is, T is replaced by k ; T at the left 
hand side of (I2T1) for every j = 1..... d. Note that this situation can be rather trivially reduced to the one 
considered in Theorem 13. 11 This is done by dividing both sides of the jth new constraint equation (with 
KjT at the left hand side) by Kj. Then we obtain (I2TT) with .4^/,, /j ? / M and Cj changed, respectively, into 
AjJi/kj, Bjhj/Kj and Cj/Kj. The vectors a and b given in (fl8l) and (fl9l) and the matrix D in (l20l ) have 
to be updated similarly. Consequently, the solution of the minimization problem as given in (l24l) and 
(1251) refers to the eigenvector v of such updated matrix I): moreover in the formulas ( 1241 ) and (1251) all the 
quantities of the form Aj^, Bj,h,i have to be changed into Ajh/Kj, Bj^j/nj- 


Since D is a rank two perturbation of a diagonal matrix one may use Weyl inequalities to establish 
conditions under which D has a unique positive eigenvalue. Such a sufficient condition is given below. 


Proposition 3.2. Let a, c € (0, oo) d and b e [0, oo) d be such that 


E 



E 

i,i =l 
*74 ? 


(■ aibj-ajb ;) 2 


> 1. 


(28) 


Then the matrix D defined in (1201) has a unique positive eigenvalue A. 

Proof. By Ai(X) < ... < Xd{X) we denote eigenvalues of d x d matrix X. Take A = aa T + bh 1 and 
B = — diag(c). Since A is of rank at most 2, we have Aj_ 2(44) = 0. Consequently, the Weyl inequality 
(121) with k = d — 2 implies Xd- 2 (D) < 0 since all eigenvalues of B arc negative. 

Therefore, 

sgn det D = (-l) d-2 sgn((5 d _i(5 d ). (29) 
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On the other hand expanding determinant of D 


det D = det 


°i + &i — c i aia 2 + b\b 2 ... aia d + bib d 
aia 2 + b x b 2 a% + b\ - c 2 ... a 2 a d + b 2 b d 


a\a d + bib d a 2 a d + b 2 l 


a d + b d ~ c d . 


and using the fact that aa T + bb is of rank at most two we obtain 

d d d d d 

det D = (—l) d 2 ^ " (aibj — ajbi ) 2 Ck + (—l) d 1 ^ ] (&? + bf) Ck + (—l) d c& 

2—1 k =1 k =1 


i,j =1 

*7O' 


fc=l 


fc=i 

k^i 


< 1 n ck 


k =1 


d d 

'-E^ + E 


0 aibj-ajbi) 2 


i =1 


*J=1 
*7O' 


Comparing the above formula with (l29l) we have 


sgn(i5 rf _i)sgn((5 rf ) = sgn 


i-E^+E 


( aibj-ajbi ) 2 


C^Cj 


2=1 


»,j=i 


which is negative due to (1281) . Since Ad_i(Z?) < X d (D) then necessarily Ad_i(Z?) < 0 < X d {D) := 
X. □ 

Note that if of + bf > a, i = 1 ,,d, then D is a matrix with all positive entries and the result 
of Theorem 13. 2l holds by the Perron-Frobenius theorem. Under such assumption there are situations in 
which condition (l28l) may not be satisfied, that is Theorem [Undoes not cover fully this classical case. 


3.2 Application to stratified two-stage sampling 

The population U consists of subpopulation U\,... ,Uj. Each subpopulation Uj is partitioned into 
primary sampling units (PSU’s) which are structured into strata Wj t h, h = 1,... , Hj, j = 1 ,,J. 
Each primary unit i € W’ ; j, consists of secondary sampling units (SSU’s) which are again structured 
into strata Wj,h,i.g, g = 1,..., Let Mjj, be the number of PSU’s in Wy./, and Nj t h,i,g be the 

number of SSU’s in Wj t h,i, g - In the schemes we describe below stratified SRSWOR is used at the second 
stage. 
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3.2.1 SRSWOR at the first stage 

The sample is chosen as follows: At the first stage a sample S of size m. ; j, of PSU’s is selected from , 
h = 1,..., Hj, j = 1,..., J, according to SRSWOR. At the second stage a sample S of size nj ^g of 
SSU’s is selected from Wj^,i, g , 9 = 1, • • •, only for PSU’s i € S, according to SRSWOR. 

The variance of 7r-estimator of the total of y over subpopulation Uj has the form, see, e.g. Sarndal, 
Swensson, Wretman (1992), Ch. 4.3 


Hi 


Hi 


Gjh,i 


h= 1 


E E (t-E wh 

h= 1 ieWj,fc 3=1 


N 2 c 2 
1 y j,h,i,g u j,h,i,g 


where 


with 


and 


with 


n 2 _ _ 

' U' “ Afj-h-l 


*e>w,h 

U y k V PSU’s V- and t j:h = ^ U 


keVi 


iew. 


i,h 


c2 _ _ 


UFT E fo* - h 


h,i,g) 


k€W jth . 


t j,h,i,9 ~ JV, L . E 


:%■ 


fc 6W a ,j, 


We assume that the size of the PSU’s sample 5 is 


J 

E E m J* = m 

j =i /i=i 


(30) 


and expected size of the SSU’s sample 5 is 


«/ Hj Gj,h,i 

E E St E E = "• 


(31) 


i=i ti=i 9=1 

Additionally we assume that the precision of 7r-estimator in every subpopulation is constant, that is 


Hj 

E 

/i=l 


j,h,i 


M .i,h 

m j,h 


TjA + E E 

tGWj.fc 2=1 


Pj,h,i,g 


n . h ~ I “Cj = T, j = 1,... ,J, 


(32) 
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where 


Gj 


ij,h - yi | M]. h D 2 J:h - 

g= 1 


N, 


j,h,i,g D j,h,i,g I > 


P : 


'j,h,i,g 


1 /\T 2 c 2 

T? ly j,h,i,g u j,h,i,g 


Hj 

and c j = ^Y,M j , h D l h , 
3 h =l 


.* 

Now we use Theorem [3J] with V j)h = U iG w, Vl where = {Wj, 9 = 1, ■ ■ •, £?>,*} 

is the set of strata of SSU’s in the ith PSU of Wy^, and with 


Aj,h ^j,h1j,hi Bj,h,(i,g ) Mj,hPj,h,i,gi tXj,h,(i,g ) ^j,h’> ^ ^ 


-1 


In the above formulas we identified i from Theorem [3j] with the pair (i. g ) in the special setting consid¬ 
ered here. Directly from Theorem 13. II we get the following result: 


Theorem 3.3. Assume that the matrix D defined by (1201) has the unique positive eigenvalue A. 

Assume that the overall PSU’s sample size is rn and the expected overall SSU’s sample size is n. 
Assume that 7 j^ > 0, h = 1,..., Hj, j = 1,,J. 

Then the optimal equal-precision allocation in strata is 


v iV M j,hTj,h 

m,-/, = m —j -77- 

ZLi «kEr=i y/M k , hlk , h 


h = 1,... , Hj, j = 1,..., J, and 




n- 


v j Mj,h \/ Pj,h,i, 1 


rm 


i,h J2k=l V k J2r =1 YlleWk r J^s=l ' 1 yfPi 


Gk,r,l 


k.r.l.s 


g = 1,. .., Gj^j, i € Wj./j, h = 1,..., Hj, j = 1,..., J,where v = (17, ..., vj) T is the unique 
eigenvector with all coordinates of the same sign of the matrix D. 

Moreover, the common optimal value of the square of precisions (CV’s) T = A, the unique positive 
eigenvalue of D. 


Remark 3.2. In the case of unequal predesigned precisions in subpopulations described in terms of 
priority weights Kj assigned to each subpopulation, as pointed out in Remark [37T] we need to change 7 j^ 
into 7 j,hl ttj and Pj,h,i,g into Pj,h,i,g/ K j in Theorem 13.31 Similarly, c :} changes into Cj / Kj and the matrix 
D, see (l20l) . and its eigenvectors and eigenvalues will be automatically updated. The same applies to Th. 
l3~4l Th. [33] and Th. 331 below. 
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3.2.2 Fixed SSU’s sample size within PSU’s 


To avoid the situation when SSU’s sample size is random one can postulate that within PSU’s in a given 
stratum Wj.h it is constant, that is there arc numbers (to be found) rijj, denoting SSU’s sample size for 
any i € VV ; ./,, j = 1,..., J, h = 1,..., Hj. Here we assume SRSWOR with no stratification at the 
second stage. Therefore, while the constraint (l30l) remains untouched the constraint (|3TI ) changes into 


J Hj 

^2 ^2 m 3 ,h n i,h = n. (33) 

3 =1 h= 1 


The requirement of the common precision yields 


Hj 

\( \ 


T _ 1 V Mj ' h 

T? m j h 

\Mj,nDl h - Y. N ^Alk,i 

+ N j,h,i S lh,i 

3 h =1 

_ \ i£Wj, h / 



Refemng again to Theorem 13. II we take # Vjj, = 1, 


Hj 


E 

h= 1 


^-j,h ^j,hUj,ht 


where 


7?,ft 


M. 


j,h 


Dlk- E N i*A: 


j,h,i 


ieWj,h 


Bj,h, 1 — 


where 


Pj,h 


1 



*22 N j,h,i^j,h,V 

i&Wj.h 


Hj 

c i = T? *22 M i’ hD lhi a j,h,l = !, X = m, z = n. 

3 h =t 

Consequently, directly from Theorem 13. II we have the following result: 

Theorem 3.4. Assume that the matrix D defined by (1201) has the unique positive eigenvalue A. 

Assume that the overall PSU’s sample size is m and the expected overall SSU’s sample size is n. 
Assume that 7 j^. > 0, h = 1 ,,Hj, j = 1,... ,J. 

Then the optimal equal-precision allocation is 


v jV M j,hlj,h 

rris h = rn ---- r— - — . 

TL 1 n Efil VWUU 

and 

Vj \/ 

n ih. = n - j— - Tr - - ■ 

mj,h Efe=1 v k Er=l \/Mk,rfik,r 

h = 1,..., Hj, j = 1,..., J,where v = (v\,... , vj) T is the unique eigenvector with all coordinates of 
the same sign of the matrix D. 

Moreover, the common optimal value of precision T = A. 


16 










3.2.3 Hartley-Rao scheme at the first stage 

The sample is chosen as follows: at the first stage a sample S of PSU’s is selected according to a 7rps 
sampling introduced by Hartley and Rao (1962), known also as 7rps systematic sampling from randomly 
ordered list. This sampling procedure is applied in each of the strata of the PSU’s. That is an addi¬ 
tional auxiliary variable Z is given in the population of PSU’s. Each stratum VV ; /, is randomly ordered 
and then rrij h PSU’s arc chosen by systematic sampling through m ? /, — 1 jumps of the size one start¬ 
ing at the random point from the interval [0,1]. Such a procedure gives a random selection of points 
xi,..., x mj h € [0, rn j;h \. We have 


n j,h,i ~ m j,h,Zj,h,ii where Zj,h,i z \ an d Zj^ — 


for any PSU i from hth stratum from jth subpopulation. The sample of PSU’s in Wj /, consists of such 
PSU i’s that 

/ i —1 i \ 


z j,h 


fr x k€ 


zi 


k = 1 , • • •, rn.j.h 


\i =t l=i / 

for some i = 1,..., At the second stage a sample S of SSU’s is selected according to SRSWOR {nj,h,i,g) 
from Wj t h,i, g , <7 = 1,..., Gj.h.i, only for PSU’s i € S. 

The approximate variance of 7r-estimator of the total of y over subpopulation Uj has the form, 
see Hartley and Rao (1962) (their formula (5.17) for the simplified variance of the 7r-estimator for the 
systematic 7rps sampling and Sarndal, Swensson, Wretman (1992), Ch. 4.3, for the variance in two-stage 
sampling) 




Hj 



Hi 

y — 

mj'h 

E 

%j,h,i ) 

-E E 

h= 1 

ieWj.h 


h= 1 ieVVj, 

Hj 


Gj,h,i 


+ V 

v 

1 r f i 

1 

Z—j ™j, h 

2. j 

Zj.h.i / \ n j,h,i, 

g Nj,h,i,g 

h= 1 

ieWj, h 

9=1 



N~ h „• „S~, 


where 


and 


with 


u 3,h,i ~ Zj,h,i yj,h) ar *d Dj,h ~ ^ Vi 


i£W. 


i,h 


Sj,h,i,9 ~ JVj-hi,-1 X/ ( yk 




t j,h,i,g ~ X] Vk ' 


k£Wj,h,i,g 
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We assume that the size of the PSU’s sample S satisfies the constraint (l30l ) while the formula (I3TI) 
for expected size of the SSU’s sample assumes the form 




J 

X! X m 3,h X Zj,h,i X n j,h,i,g 
3 = 1 h =1 iGVVj-.h 9=1 


= n. 


(34) 


Denote 

Dj,h = 'y ^ + Zj,h,i)- 

ieWj, h 

Additionally we assume that the precision of 7r-estimator in every subpopulation is constant (actu¬ 
ally, since we use an approximate formula for the variances, it is not precision but rather approximate 
precision), that is 


Hi 


G j.h.i 


^ m / j,h I ^^ Zj,h,i ^ ^ 


h =1 


fij,h,i,g 
j,h,i / 'H J j,h,i,g 
i€Wj,h 9 =1 


- Co = T, 


j = i, ■ ■ •, j, 


where 


G 


j,h,i 


lj,h ~ JV D j,h X z2 i X N 3,h,i,g Sj,h,i,g ’ 


(35) 


(36) 


ieWji 


9=1 




/d . . _ 1 ]\j2 c 2 


and 


C 9 “ T? 


feE E 




h =1 zGWj h 


for Tj — Yhh= l Yliew jth U, j — 1, ■ ■ ■, J- 

At this stage we again refer to Theorem 13. 11 once again using the identification i = ( i,g ). Thus we 
define 

Aj,h — lj,hi Bj,h,(i,g) — Zj t h,i ’ a 3,h,(i,g) ~ Zj,h,ii x ~ m i Z — Tl. 

As a conclusion from Theorem 13. II we have the result describing (approximate) optimal equal-precision 
allocation in the scheme considered in this subsection: 

Theorem 3.5. Assume that the matrix D defined in (1201) has a unique positive eigenvalue A. Assume 
that the overall PSU’s sample size is rn and the expected overall SSU’s sample size is n. Assume that 
1j,h > 0, h = 1,.. ., Hj, j = 1 

Then the (approximate) optimal equal-precision allocation in strata is 


m j,h = 


m ■ 


v js/fjA 


£fc=l v k Hf= 1 y/lk,. 
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h = 1, ..., Hj, j = 1, ..., J and 




n- 


v j VPj,h,i,g/ z j,h,i 


m j,h Yhk= 1 V k J2r =1 J2leW k r J2s=l’ 1 ^ 


( 6 Wk,r 


Gk,r,l 


Jk.r.l.s 


g = 1,..., Gj } h,i, i € yVj.h-, h = 1,..., Hj, j = 1,..., J,where v = (t>i,..., vj) T is the unique 
eigenvector with all coordinates of the same sign of the matrix D = aa T + bb 1 — diag(c). 

Moreover, the common optimal value of precision T = A, the unique positive eigenvalue of D. 


3.2.4 Fixed SSU’s sample size within PSU’s 

Similarly as in the previous section we consider now the situation when samples sizes of SSU’s are 
fixed for PSU’s belonging to the same strata within subpopulation. Then the constraint for common 
(approximate) precision for all subpopulation reads 


where 


Hi 


^2 m ih I + n - h ^Y 


h =1 


7 ,h / Zj.h.i 

i£Wj,h 


Cj — T, j — 1) ■ ■ ■; Jj 


Gjh.i 


li,h - ( Df h - Y l2~ N hh,i,g S lh,i,s 

g =1 


Hi 


Pj,h,i ~ J? N j,h,i Sj,h,i and c j ~ JV Y, ^2 Z j>h,i u j,h,i- 


h=l iGVVj'h 


The constraints regarding sizes of samples assume the form 


J H j 

^2 Y^ rn :i- h = 171 and 
3 =1 h =1 

In Theorem [3J] we take # V y /, = 1, 


J Hj 

Y ^2 m hh n 3,h = 

3 =1 h =1 


Aj,h ~ lj,hi 


-Bj,h - Bq h 1 - 


j,h, 1 


E 

ieWj.h. 


Pj,h,i 
Zj,h,i 5 


Cj as above and aj h,i = 1, x = m, z = n. Consequently, Theorem 13.11 gives the following result: 

Theorem 3.6. Assume that the matrix D defined in (120b has a unique positive eigenvalue A. Assume 
that the overall PSU’s sample size is rn and the overall SSU’s sample size is n and the sample SSU’s 
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sizes n h f, are fixed (but unknown) within strata in subpopulations. Assume that 7 > 0, h = 1,..., Hj, 
j = 

Then the (approximate) optimal equal-precision allocation in strata is 


m 


j,h vn j 


Efc=1 V k Eg =1 


/i = 1, ..., Hj, j = 1,..., J and 


v j \[Wh 

Tin h = ttl -7-77- . 

m j,h Efc=l v k Er=l y/Bk,r 

h = 1 ,Hj, j = 1,..., J, where v= (v±,..., vj) T is the unique eigenvector with all coordinates 
of the same sign of the matrix D = aa T + bb T — diag(c). 

Moreover, the common optimal value of precision T = A, the unique positive eigenvalue of D. 


4 Numerical experiments 

In the experiments, described below, we used the method developed in Subsection 13.2.31 to analyze 
optimal equal-precision allocation in the Polish Labour Force Survey (LFS). 

An artificial population has been created on the basis of results of a sample survey which accompa¬ 
nied the last virtual census in Poland. The sample for this survey was drawn through stratified sampling 
with strata at a NUTS 5 level (we follow the Eurostat standard NUTS nomenclature of territorial units for 
statistics in EU; in Poland it refers to the level of municipalities). This sample covered 20% dwellings 
in the country. To create the population for simulation purposes the records of the sample where cloned 
together with data related to surveyed persons in each dwelling with the cloning multiplicity equal to 
rounded corrected weights for dwellings. As a result an artificial population of 13 243 000 households 
was constructed. 

The sampling scheme in the experiment was exactly the same as in the LFS survey, that is; two-stage 
with stratification at the primary units level. In each such strata of primary units the sample was drawn 
according to the standard Hartley-Rao method, with first order inclusion probabilities proportional to the 
number of dwellings in PSU’s. The definition of PSU was based on the one adapted in the LFS as well. 
That is, in urban regions PSU’s were identified with so called census clusters and in non-urban areas they 
were identified with enumeration census areas. The SSU’s were just households. The strata definition 
for PSU’s were adapted from the LFS, which resulted in the total of 61 strata of PSU’s. Taking into 
account one of the four subsamples for the quarter of year used in the LFS (one of the two which are 
new in the survey) the total number of PSU’s and SSU’s was designed to be m = 1872 and n = 13676, 
respectively. 

On the basis of such pseudo-population (with the variable "number of unemployed in the household" 
transferred from the 20% survey, which accompanied the last virtual census) suitable initial parameters 
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for the procedure described in Subsection 13.2.31 were prepared. The variables: number of dwellings in 
PSU’s and number of unemployed in a household (SSU) were essential in constructing matrices a a T , 
b 6 r and diag(c). As subpopulations the NUTS2 level (in Poland it refers to voivodships) was used. The 
standard R function eigen for numerical computation of eigenvalues and eigenvectors was used - see R 
Core Team (2013). Examples of R- codes we used for optimal equal-precision allocation are available at 
https://github.com/rwieczor/eigenproblem_sample_allocation. Theoretical value of CV defined through 
the maximal eigenvalue was numerically found to be about 9.7%, which is almost exact value of the 
square root of the largest eigenvalue, as it should be according to the theoretical results obtained in 
previous sections. The optimal equal-precision allocation, based on the eigenvector related to the largest 
eigenvalue, was a base for drawing samples of PSU’s and then of SSU’s. The experiment was repeated 
independently 100 times with the average PSU’s sample size equal to 1872 and the average SSU’s sample 
size equal to 13669. In each experiment, like in the original LFS survey, the precision of estimates of 
the variable "number of unemployed at NUTS2 levels" was evaluated through a bootstrap method. One 
of the variations of the bootstrap method was used, where in each stratum a multiple resampling (in this 
case 500 times) takes place with replacement of rih — 1 subsamples out of rih PSU’s selected for the 
survey in the hth stratum - see McCarthy and Snowden (1985) (described also in the monograph Shao 
and Tu (1995), Ch. 6.2.4). After resampling the original weights for sampling units are properly rescaled 
and bootstrap variance estimate of the corresponding indicator is obtained by the usual Monte Carlo 
approximation based on the independent bootstrap replicates. These results were compared to other 
100 independent experiments in which the sample was drawn from the pseudo-population according 
to the standard LFS procedure, which is thoroughly described in Popinski (2006). Actually, we used 
a simplified version of the standard procedure used in the LFS, because we did not take into account 
the fact that the real sample consists of four elementary subsamples together with a rotation scheme. 
Instead, we considered only one of such four elementary subsamples. The same variable was estimated 
and the precision was evaluated again through the bootstrap procedure. The means of the result over 100 
independent experiments are gathered in Table 1 below. One can easily observe that the proposed new 
procedure gives an average about 14% gain in CV, when compared to the standard LFS procedure. 


5 Conclusions 

The method of eigenvalue and eigenvectors was applied to optimal equal-precision allocation in two- 
stage sampling for the first time in Niemiro and Wesolowski (2001). In the present paper we emphasize 
its versatility by considering more abstract setting covering also single-stage sampling (in Section 2) 
and wider family of two-stage sampling schemes (with stratification at the second stage). In particular, 
Hartley-Rao sampling at the first stage is taken care of. Additionally the case of constant SSU’s sample 
sizes within PSU’s from the same strata is covered by the proposed general methodology. In general the 
approach is based on looking for a unique positive eigenvalue of a matrix, which is properly defined in 
terms of population quantities. This matrix appears to be a low-rank (< 2) perturbation of a diagonal ma- 
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NUTS2 

standard 

standard 

optimal 

optimal 

standard 

optimal 


SSU allocation 

PSU allocation 

SSU allocation 

PSU allocation 

CV 

CV 

PL11 

884 

130 

888 

127 

10.6 

9.4 

PL 12 

1170 

156 

1033 

153 

10.8 

9.5 

PL21 

884 

130 

861 

130 

10.9 

9.4 

PL22 

1170 

182 

999 

114 

9.2 

9.6 

PL31 

754 

104 

813 

117 

11.2 

9.5 

PL32 

702 

104 

661 

96 

10.2 

9.7 

PL33 

676 

78 

731 

117 

12.4 

9.5 

PL34 

832 

78 

882 

120 

11.8 

9.5 

PL41 

936 

156 

927 

129 

10.3 

9.5 

PL42 

806 

104 

836 

112 

11.7 

9.7 

PL43 

572 

78 

814 

95 

12.3 

9.9 

PL51 

910 

130 

880 

113 

10.6 

9.7 

PL52 

988 

156 

942 

133 

10.3 

9.5 

PL61 

728 

104 

779 

101 

10.9 

9.6 

PL62 

780 

78 

754 

103 

11.0 

9.6 

PL63 

884 

104 

869 

111 

11.0 

9.6 

Sum 

13676 

1872 

13669 

1871 




Table 1: Comparison of allocations and precision between standard and optimal procedures in the LFS 
on the basis of numerical experiments for a census-based pseudo-population 
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trix. It is proved that the eigenvector associated with the unique positive eigenvalue of this matrix has all 
components of the same sign. Both the eigenvalue and the eigenvector can be calculated using standard 
procedures, see e.g. the R-code we posted at https://github.com/rwieczor/eigenproblem_sample_allocation 
After the eigenvector is known the optimal equal-precision allocation is then derived easily. The numer¬ 
ical example shows that application of the proposed method to Polish LFS improves CV of estimates 
for subpopulations by 14% on average, when compared to the standard allocation used at present in this 
survey. 

The allocation procedures and formulas developed above, similarly as the classical ones, depend on 
population quantities as Sj h ■ which, by rule are unknown, and maybe difficult to estimate e.g. from 
previous surveys. Then a possible approach would be to adopt some model assumptions and replace 
Sj hi g ’s by their model expectations (as done e.g. in Clark (2009) in a somewhat different setting of 
the problem, when subpopulations may cut across PSU’s). An alternative approach would be to refer to 
auxiliary variable X, which is correlated with the variable of study and available for all PSU’s and/or 
SSU’s in the population from administrative registers and using S? hi (A’)’s instead of S'j h ■ ’s. 

Finally, we mention that the method, we developed in this paper allows us to incorporate different 
predesigned subpopulations levels of precision priority rtj > 0, j = 1,..., J, as described in Section 3. 
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